# AudioSet Fine-tuning

Vit Base Patch16 1024 128.audiomae As2m Ft As20k
A Vision Transformer (ViT)-based audio processing model, pre-trained on AudioSet-2M using self-supervised masked autoencoder (MAE) method and fine-tuned on AudioSet-20k
Audio Classification
V
gaunernst
335
2
Ast Finetuned Audioset 16 16 0.442
Bsd-3-clause
An audio spectrogram transformer fine-tuned on the AudioSet dataset, utilizing a vision transformer architecture to process audio spectrograms, achieving excellent performance in audio classification tasks.
Audio Classification Transformers
A
MIT
35
1
Ast Finetuned Audioset 14 14 0.443
Bsd-3-clause
An audio spectrogram transformer fine-tuned on the AudioSet dataset, which converts audio into spectrograms and processes them using a vision transformer architecture, achieving excellent performance in audio classification tasks.
Audio Classification Transformers
A
MIT
194.20k
5
Ast Finetuned Audioset 12 12 0.447
Bsd-3-clause
An Audio Spectrogram Transformer (AST) fine-tuned on the AudioSet dataset, using ViT architecture to process audio spectrograms, achieving excellent performance on multiple audio classification benchmarks.
Audio Classification Transformers
A
MIT
25
0
Ast Finetuned Audioset 10 10 0.448 V2
Bsd-3-clause
An audio spectrogram transformer fine-tuned on the AudioSet dataset, which converts audio into spectrograms and processes them using a vision transformer, excelling in audio classification tasks.
Audio Classification Transformers
A
MIT
2,072
0
Ast Finetuned Audioset 10 10 0.448
Bsd-3-clause
An Audio Spectrogram Transformer (AST) fine-tuned on the AudioSet dataset, utilizing a vision transformer architecture to process audio spectrograms, achieving excellent performance in audio classification tasks.
Audio Classification Transformers
A
MIT
326
0
Ast Finetuned Audioset 10 10 0.4593
Bsd-3-clause
The Audio Spectrogram Transformer (AST) is a model fine-tuned on AudioSet, which converts audio into spectrograms and applies a vision transformer for audio classification.
Audio Classification Transformers
A
MIT
308.88k
311
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase